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The two surface glycoproteins S and HE of human coronavirus OC43 (HCV-OC43) were isolated from the viral 
Membrane and purified. Only the S$ protein was able to agglutinate chicken erythrocytes, indicating that this viral 
protein is the major hemagglutinin of HCV-OC43. The receptor determinant recognized by this virus on the surface of 
erythrocytes is N-acetyl-9-O-acetylneuraminic acid {Neu5,9Ac,) which is also used by bovine coronavirus for attach- 
ment to cells. By analyzing erythrocytes containing different amounts of Neu5,9Ac, in either of two linkage types, it was 
found that there are subtle differences in the affinity of both viruses for 9-O-acetylated sialic acid. Bovine coronavirus 
was more efficient in recognizing low amounts of Neu5,9Ac, «2,3 linked to galactose, whereas HCV-0C43 was supe- 
rior with respect to the «2,6 linkage. The gene coding for the S protein of HCV-OC43 was cloned and sequenced. A 
large open reading frame predicts a polypeptide of 150 kDa in the unglycosylated form. A protein of about 190 kDa is 
expected if the 20 potential glycosylation sites are used for attachment of N-linked oligosaccharide side chains. These 
predictions were confirmed by jn vitro transcription and translation of the gene in the presence or absence of canine 
pancreatic microsomal membranes. A high degree of sequence homology was found between the S proteins of HCV- 
OC43 and bovine coronavirus. Structural and functional analyses of more strains should help to identify the location of 


the sialic acid-binding site. © 1993 Academic Press, Inc. 


INTRODUCTION 


Coronaviruses are a family of large, enveloped vi- 
ruses, which contain a single-stranded RNA genome 
of positive polarity with a size of about 30 kilobases. 
The virion contains two to three structural glycopro- 
teins: the membrane protein (M); the spike, surface, or 
peplomer protein (S); and the hemagglutinin/esterase 
protein (HE). The presence of a HE protein is a charac- 
teristic feature of a serological subgroup of coronavi- 
ruses comprising human coronavirus (HCV) strain 
OC43, bovine coronavirus (BCV), hemagglutinating en- 
cephalomyelitis virus (HEV), and mouse hepatitis virus 
(MHV). Coronaviruses, which lack an HE protein and 
which have no antigenic relationship to the above- 
mentioned viruses, include human coronavirus HCV- 
229E, porcine transmissible gastroenteritis virus 
(TGEV), and avian infectious bronchitis virus (IBV) (for a 
review see Siddell et a/., 1983; Spaan et a/., 1988). 
Human coronaviruses are known as respiratory patho- 
gens; in addition, they have been associated with diar- 
rhea and multiple sclerosis (McIntosh, 1974; Resta et 
al/., 1985; Burks et a/., 1980). 

The S protein is a typical class | membrane protein, 
with an amino-terminal signal peptide, which is 
cleaved during protein processing and a carboxy-ter- 
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minal hydrophobic membrane anchor. The spike pro- 
teins of BCV, MHV, and IBV are cleaved by an intracel- 
lular protease into two subunits of comparable size. 
Furthermore the S protein is expected to have an over- 
all structure similar to the hemagglutinin of influenza A 
virus (Wilson et af, 1981; De Groot et a/., 1987; Ras- 
schaert and Laude, 1987). The S protein is a major 
target of the cellular immune response to coronavir- 
uses and plays an important role in the initial stage of 
infection. It mediates the attachment of the virus to the 
cell surface receptors and induces the fusion of the 
viral and cellular membranes (reviewed by Spaan et a/., 
1988). 

Different types of virus receptors have been identi- 
fied for coronaviruses. A member of the carcinoem- 
bryonic antigen family of proteins is a receptor for MHV 
{Williams et a/., 1991); TGEV and HCV-229E recognize 
aminopeptidase N as a receptor (Delmas et a/., 1992; 
Yeager et a/., 1992). BCV uses N-acetyl-9-O-acetyl- 
neuraminic acid as a receptor determinant to initiate 
infection (Schultze and Herrler, 1992). Results ob- 
tained with erythrocytes indicate, that the attachment 
of HCV-OC43 and HEV is also dependent on the pres- 
ence of NeuS,9Ac, on the cell surface (Vlasak et a/., 
1988; Schultze et a/., 1990). Although the HE protein 
has an affinity for 9-O-acetylated sialic acid, too, the S 
protein has been shown to be the major sialic acid- 
binding protein of BCV (Schultze et a/., 1991b). 

In the present report we show that the S protein is 
also the major sialic acid-binding protein of HCV- 
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OC43. The spike protein was further analyzed (i) by 
determining the nucleotide sequence of the corre- 
sponding gene, (ii) by in vitro transcription and transla- 
tion of the cloned gene, and (iii) by analyzing the effi- 
ciency in recognizing Neu5,9Ac,. 


MATERIALS AND METHODS 
Viruses and cells 


HCV-0C43 was obtained from the American Type 
Culture Collection (ATCC) and passaged three times in 
human rectal tumor cells (HRT-18). Three days p.i. 
hemagglutination titers of 256 HAU/m! were deter- 
mined in the supernatant using 0.5% chicken erythro- 
cytes for HA-titration (Schultze et a/., 1990). HRT-18 
cells were obtained from ATCC and grown in Dul- 
becco’s minimal essential medium supplemented with 
10% fetal calf serum. 


Purification of virus 


Two to 4 days p.i. supernatants of infected cells 
were harvested and cenirifuged for 30 min at 8,000 
rpm and 4°. Virus was pelleted from the clarified me- 
dium by centrifugation for 2.5 hr at 25,000 rpm and 4° 
using an SW28 rotor. The virus pellet was homoge- 
nized in phosphate-buffered saline deficient in magne- 
sium and calcium ions (PBS def.) by forcing it through a 
syringe needle and loaded on a discontinuous sucrose 
gradient consisting of 60, 50, and 30% sucrose in PBS 
def. After centrifugation for 1.5 hr at 40,000 rpm and 4° 
in an SW41 rotor, the virus was collected at the 30— 
50% interface. The virus was diluted with PBS def. and 
pelleted for 1 hr at 40,000 rpm and 4° in an SW41 rotor. 


Purification of viral glycoproteins 


The glycoproteins S and HE were isolated from the 
viral membrane by treatment with octylglucoside and 
purified by sucrose gradient centrifugation as de- 
scribed recently for BCV (Schultze et a/., 1991a,b). The 
glycoproteins were analyzed for HA activity and acetyl- 
esterase activity as described recently (Schultze et a/., 
1990, 1991b). 


Resialylation of erythrocytes 


Neu5,9Ac, was attached to cell surface glycoconju- 
gates in two linkage types by incubating erythrocytes 
from 1-day-old chicken with CMP-activated sialic acid 
and either of two sialyltransferases (Galé1,3GalNAc 
a2,3-sialyltransferase or Gal8t,4GIcNAc @2,6-sialyl- 
transferase, respectively; Boehringer-Mannheim) as 
described recently (Schultze et a/., 1990, 1992). 


Preparation of viral RNA 


The pellet of purified virus was homogenized in TE 
buffer, pH 7.6, and subjected to an overnight protein- 
ase K digestion (Sambrook et a/., 1990). Viral RNA was 
extracted twice with phenol/chloroform 1/1 and twice 
with chloroform/isoamylalcohol 24/1 followed. The up- 
per phase was precipitated by 3 M NaAc, pH 5.3, and 
EtOH at —20° for 48 hr. After pelleting and washing 
with 70% EtOH, vRNA was dissolved in DEPC-treated 
water. 


DNA sequencing 


The cDNA of the cloned HCV-OC43 S gene was se- 
quenced with a modification of the dideoxy chain termi- 
nation method (Sanger et a/., 1977) using Sequenase 
2.0 (USB). Sequencing primers were between 18 and 
26 nucleotides in length and were synthesized with an 
oligonucleotide synthesizer (Applied Biosystems). Part 
of them were designed on the basis of the published S 
gene sequence for BCV strain Mebus (Abraham et a/., 
1990). The reaction products were analyzed on 5 and 
6% acrylamide gels containing 7 M urea. 


Cloning of the S gene 


The vRNA was transcribed into DNA using MoMuLV 
reverse transcriptase (Boehringer-Mannheim). Be- 
cause of the inability of the enzyme to produce a full- 
length cDNA, the S gene was transcribed in two over- 
lapping parts (Fig. 1). Primer Reb-? binds to nucleo- 
tides 2244-2264 and Rbx-2 to the 3-end of the S gene. 
Both of them contain at the 5’-end a BamHI linker, Rbx- 
2 an additional Xhol linker (Reb-1 5' CGCGGATCC- 
TACCTACTGTGAGATCACATG 3% Rbx-2 5’ CCTC- 
GAGGGATCCCACGAACTTAGTCGTCATGTG 33). 
First-strand cDNA synthesis was carried out in a vol- 
ume of 20 ul containing 10 mM DTT, 6 mM MgCl, 1 
ug VRNA, 100 pg/ml BSA, 50 mM Tris-HCl, pH 8.3, 40 
mM KCI, dNTP at 1 mM each, 0.75 uM primer, 10 U 
MoMuLV reverse transcriptase. Denaturation for 3.5 
min was followed by annealing for 30 min at room tem- 
perature. Polymerization was carried out at 42° for 1 
hr. Afterward the enzyme was denatured by incubation 
at 99° for 7 min. This step was followed by PCR using 
the Gene Amp Kit and a thermal cycler (Perkin-Elmer 
Cetus) according to the manufacturer’s instructions. 

The sense primers were used at 0.15 uM and con- 
tain at their 5'-end a BamHl linker (Pbam-1) or an Xhol 
linker (Pxho-2), respectively. (Pbam-1: 5’ CGCGGA- 
TCCGCTGCATGATGCTTAGACCA = 3’, Pxho-2: 5’ 
CCTCGAGGACCAGCATTGCTATTTCGGAATAT 3'}. 
There were 25 cycles performed each consisting of 
denaturation for 1 min at 95°, annealing for 1 min at 
59° and a 3 min extension step at 72°. In each cycle 
the polymerization step was automatically extended 
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Fig. 1. Flow chart of the procedure chosen for cloning the S gene 
of HCV-0C43. 


for 2 sec. Five independent reverse transcription and 
PCR reactions were done and mixed afterward. To re- 
move the Taq polymerase which might have been 
bound to the reaction products (Crowe et a/., 1991), 
the whole mixture was digested with proteinase K (17 
ug/ml) for 1 hr at 37°, extracted with phenol/chloro- 
form/isoamylalcohol (50/48/2), and precipitated. After 
digestion with BamHI or Xhol, respectively, the prod- 
ucts were separated on a 1% agarose gel and eluted 
using Quiaex (Diagen). The two S gene fragments were 
subcloned into pBluescript (Stratagene). Strain XI-1 
Blue (Stratagene) of Escherichia coli was transformed 
with this plasmid using standard procedures. The two 
overlapping inserts were prepared by digesting the re- 
combinant plasmids with either BamHl or Xhol, respec- 
tively, followed by agarose gel electrophoresis and elu- 
tion. They were dephosphorylated with 8 units calf in- 
testinal alkaline phosphatase (Boehringer-Mannheim) 
for 1 hr at 37°, extracted with phenol/chloroform/iso- 
amylalcohol, and precipitated. Then both fragments 
were blunt-ended with Pvull (Fig. 1) and subjected a 
second time to electrophoresis and elution. Both frag- 
ments were ligated with 12.5 units T4-ligase (Boeh- 
ringer-Mannheim) for 12 hr at 14°. An aliquot of this 
ligation mixture was incubated with BamHl- and Xhol- 


digested pBluescript and 5 units T4-ligase at 16° for an 
additional 12 hr. After transformation, three positive 
clones were sequenced at both ends of the S gene 
(300 nucleotides in both directions). No nucleotide dif- 
ference was found. One of the positive clones was se- 
quenced completely in both directions. 


In vitro transcription and translation 


The recombinant plasmid pBSKS*/atS8 was linear- 
ized with Xhol, digested with proteinase K, extracted 
with phenol/chloroform, precipitated, and dissolved in 
DEPC-treated TE-buffer, pH 7.5, at 1 po/pul. in vitro 
transcription was done in a total volume of 25 ul, using 
2 yg of linearized plasmid DNA and the mCAP kit (Stra- 
tagene), which yielded capped mRNAs. These mRNAs 
were subsequently treated for 5 min at 37° with 10 
units RNase-free DNasel and then extracted once with 
phenol/chloroform. After precipitation the mRNA was 
resuspended in TE buffer. /n vitro translation was per- 
formed with a commercial kit (IN VITRO EXPRESS, 
Stratagene), which consisted of a pretreated white rab- 
bit reticulocyte lysate. The reaction was performed ac- 
cording to the manufacturer's instructions with 1-3 ng 
mRNA and *°S-methionine. To obtain proteins which 
were processed and core-glycosylated, translation re- 
actions were done using canine pancreatic micro- 
somal membranes (Promega). The a-mating factor 
from S. cerevisiae served as a control for signal pro- 
cessing and the B-lactamase from E. coli as a control 
for glycosylation (data not shown). 


Immunoprecipitation 


After translation, the proteins were immunoprecipi- 
tated with a rabbit antisera directed against the S pro- 
tein of BCV. Incubation for 1 hr on ice was followed by 
incubation for 30 min at 4° with protein A-Sepharose 
(Serva) on a shaker and three cycles of washing at 4° 
with PBS def. Afterward, the immunoprecipitate was 
resuspended in sample buffer containing 100 mM 
DTT, heated to 95°, subjected to SDS-PAGE on 8% 
gels (Laemmii, 1970), and prepared for fluorography. 


RESULTS 
The S$ protein of HCV-OC43 is a hemagglutinin 


In order to assign the hemagglutinating activity of 
HCV-0C43 to one of the surface proteins, the glyco- 
proteins were isolated in purified form. The membrane 
proteins were solubilized by detergent treatment and 
separated by sucrose gradient centrifugation as de- 
scribed recently for BCV (Schultze et a/., 1991a,b). A 
peak of the S protein was detected in fraction 3, 
whereas HE was recovered from fraction 6. A similar 
sedimentation behavior has been reported for the gly- 
coproteins of BCV (Schultze et a/., 1991b). These frac- 
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TABLE 1 


ANALYSIS OF THE GLYCOPROTEINS S ANO HE oF HCV-OC43 For 
ACETYLESTERASE AND HEMAGGLUTINATING ACTIVITY 


HA activity Acetylesterase 
Glycoprotein? (HA-units/mi} (OD405) 
S protein 1024 <0.01 
HE protein <2 0.56 


* The glycoproteins of HCV-OC43 were purified by sucrose gra- 
dient centrifugation. Fractions 3 and 6 representing the peak frac- 
tions of S and HE, respectively, were dialyzed and analyzed for HA 
and esterase activity. 


tions were analyzed for acetylesterase and hemaggluti- 
nating activity. As shown in Table 1, only fraction 6 was 
able to cleave p-nitrophenyl acetate, confirming that 
the HE protein is the esterase of HCV-OC43. This pro- 
tein was, however, unable to agglutinate chicken eryth- 
rocytes. On the other hand, the S protein present in 
fraction 3 was found to be a very potent agglutinating 
agent. This result indicates that the S protein is the 
major hemagglutinin of HCV-OC43. This finding is in 
agreement with the data reported for BCV (Schultze et 
al., 1991b). A difference between both viruses was ob- 
served in the electrophoretic behavior of the S protein. 
While the surface protein of BCV is cleaved by a cellu- 
lar protease into two subunits, $1 and S2, the glyco- 
protein of HCV-OC43 is present on virions in the un- 
cleaved form (not shown). As has been shown previ- 
ously, cleavage is possible by incubation with trypsin 
(Hogue and Brian, 1986). 


Comparison of the sialic acid-binding activity of 
HCV-0C43 and BCV 


The sensitivity of the erythrocyte receptors for HCV- 
0C43 to acetylesterase suggested that 9-O-acetylated 
sialic acid is a receptor determinant for this virus (Vla- 
sak et a/., 1988). Therefore, it can be concluded from 
the data presented in Table 1 that the S protein is not 
only the major hemagglutinin but also the major sialic 
acid-binding protein. Direct evidence for the impor- 
tance of Neu5,9Ac, was obtained by resialylation of 
erythrocytes from 1-day-old chicken. These cells lack 
9-O-acetylated sialic acid and are resistant to agglutina- 
tion by HCV-OC43. Neu5,9Ac, was attached to the 
cell surface by incubation of neuraminidase-treated 
erythrocytes with sialyltransferase and CMP-activated 
sialic acid. In order to determine whether there is a 
preference of the virus for a certain linkage type, two 
different transferases were used, which allow to 
compare the following oligosaccharide structures: 
Neu5,9Acz,a2,3Galé1,3GalNAc, and NeuS,9Ac,a2, 
6Galf1,4GIcNAc, respectively. A further variation was 
obtained by applying different concentrations of CMP- 


sialic acid. In this way batches of erythrocytes were 
obtained which differed in their content of Neu5,9Ac,. 
In the range of concentrations used in the present 
work, a proportional amount of Neu5,9Ac, is trans- 
ferred to the cell surface (Herrler et a/., 1992). As 
shown in Table 2, HCV-OC43 was able to recognize 
9-O-acetylated sialic acid in both the a2,3 and the #2,6 
linkage. The exact amount of sialic acid transferred 
was not determined, because CMP-Neu5,9Ac, was 
not available in radioactive form. Using erythrocyte 
preparations with different amounts of 9-O-acetylated 
sialic acid on the surface, it was possible to compare 
the efficiency of different viruses in recognizing 
Neu5,9Ac,. HCV-OC43 was found to require jess si- 
alic acid in the a2,6-linkage than BCV for agglutination 
of erythrocytes. The a2,3-linkage, on the other hand, 
was recognized more efficiently by BCV. As far as the 
preference for one of the two linkages is concerned, 
HCV-0C43 was found to resemble influenza C virus 
(strain Johannesburg/1/66) more than BCV (Table 2). 


Sequence analysis of the S gene 


To determine the sequence of the S gene of HCV- 
0C43, viral RNA was isolated, transcribed into cDNA, 
and amplified by PCR. The DNA obtained was cloned 


TABLE 2 


COMPARISON OF THE EFFICIENCY OF HCV-OC43, BCV, AND INFLUENZA 
C Virus JOHANNESBURG/ 1/66) IN RECOGNIZING 9-O-ACETYLATED SIALIC 
ACID AS A RECEPTOR DETERMINANT 


HA activity (HA units/ml) 


CMP-sialic 
acid? Influenza 
Linkage type {nmol} BCV = HCV-0C43 Cc 
a2,3 4 1024 128 612 
2 512 32 128 
1 64 <2 <2 
0.5 16 <2 <2 
2.6 8 512 128 612 
4 128 64 512 
2 <2 64 512 
1 <2 32 612 
0.5 <2 32 Bl2 
0.25 <2 32 512 
Erythrocytes from 
adult chicken _ 1024 128 1024 


* Neuraminidase-treated erythrocytes from i-day-old chicken 
were incubated as a 50% suspension in a volume of 100 ul with 
either 1,5 mU of Gal8t, 3GaiNAc e2,3 or 5 mU of Gal81, 4GIcNAc 
«2,6-sialyltransferase and the amount of CMP-activated NeuS, 9Ac, 
indicated. The resialyiated cells were used to determine the HA titer 
of BCV, HCV-OC43, and influenza C virus. For comparison the HA 
titer of the virus suspensions obtained with untreated erythrocytes 
from adult chicken is given at the bottom of the table. 
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1 AAACATGT DY TGATACTITTAATTTccrTAG CTGTTATAGGAGATITAAAGTGTACT 
71 'TCAGATACTAGTTATATTAATGATAAAGACACOGGTCCTCCTCCTATAAGTACTGATACTGTTGATGTTA 
141 CTAATGGTTTGGGTACTTATTATOTTTTAGATCOTGTGTATITAAATACTACGTITITGCTTAATGGTTA 
211 PTACCOTACTTCAGGTTCCACATATCGTAATATGGCACTGRAGGGAAGTGTACTATTGAGCAGACTATGG 
261 TTTAAACCACCATTTCIT?CTGATTTTATTAATGGTATTTTTGCTAAGGTCAAAAATACCARGGTTATTA 
351 _ AAGATGGTGTAATGTATAGTGAGTTCCCTECTATAACTATAGGTAGTACTITIGTAAATACATCCTATAG 
421 _'TGTGGTAGTACAACCACGTACAATCAATTCAACACAGGATGGTTATAATAAATTACAAGGTCTTTTAGAG 
491 GICTCTGTPIGCCAGTATAATATGTGCGAGTACCCACARACGATTTGTCATCCTAACCTGGGTAATCATC 
561 _ GCAAAGAACTATGGCATITGGATACAGGTGTTOTTTCCTCTTTATATAAGCGTAATTTCACATATGATC? 
631 GAATGCTGAPTATTTGTATTTECATTTTTATCAAGAAGGTGGTACTTTTTATGCATATTTTACAGACACT 
701 GGTCTTGTTACTAAGTTTTTGTTTAATGITTATTTAGGCATGGCGCTTTCACACTATTATGTCATGCCTC 
771‘ TGACTTGTAATAGTAAGGTTAAGAATGGTTTTACTTTAGAATATTGGGTTACTCTCTCACTTCTAGACAA 
841 _‘TATTTACTCGCTTTCAATCAAGATGGTATTATTTTTARTGCIGTIGATTGTATGAGTGATITTATGAGTG 
911 _ AGATTAAGTGTAAAACACAATCTATAGCACCACCTACTOGTGTTSATGARTTARACGGTTACACTGTTCA 
981 GCCAATCGCAGATGTTTACCGACGTAAACCTAATCTTCCCAATTGCAATATAGAAGCTTGGCTTAATGATA 
1051 AGTCGGTGCCCTCTCCATTAAATTGGGAACGTAAGACATTTICAAATTGTAATTTTARTATGAGCAGCCT 
1121 GATGTCTTTTATTCAGGCAGACTCATTTACTIGTAATAATATTGATGCTGCTARGATATATGGTATGTG? 
1191 TITTCCAGCATAACTATAGATAAGTTTGCTATACCCAATGGCAGGAAGGTTGACCTACAATTGGGTAATT 
1261 TGGGCTATTTGCAGTCATTTAACTATAGAATTGATACTACTGCAACAAGTTGTCAGTTGTATTATAATTT 


1331  ACCTGCTGCTAATGTITCIGTIAGCAGGTTTAATCCTICTACTTGGAATAAGAGATTIGGTTTTATAGAA 


1401 GATTCTGTTTTTAAGCCTCGACCTGCAGGTGTTCTTACTAATCATGATGTAGTTTATGCACAACACTOTT 


aya 


1541 


1611 _ ATTACATTTAAAGCTACAGGTACTTATAAGTGCCCCCAAACTAAATCTTTAGTTGGCATAGGTGAGCACT 


1681 GT 7 


1781  GGGTTGGTCTOCAGACTCTTGTTTACAAGGAGACAAGTGTAATATTTTIGCTAATTITATITIGCATGAT 


1821 GTTAATAGTGGTCTTACTTGTTCTACTGATTTACAAAAAGCTAACACAGACATAATICTIGGTGITTGTS 
1891 TTAATTATGACCTCTATGGTATTTTAGGCCAAGGCATTTPIGTTGAGGTTAATGCGACTTATTATAATAG 


1961 _ TTGGCAGAACCTTTTATATGATTCTAATGGTAATCTCTACGGTTTTAGAGACTACATAACAAACAGAACT 


2031 TTTATGATTCGTAGTTGCTATAGCGGTCGTGTTTCTGCGGCCTTTCACGCTAACTCTTCCGAACCAGCAT 
2101 TECTATCTCGGAATATTAAATGCAACTACGTTTTTAATAATAGTCTTACACGACAGCTGCAACCCATTAR 
2171 CTATTTTGATAGTTATCTIGETTCTGTTGTCAATGCTTATAATAGTACTGCTATTITTGTTCAAACATGT 
2242 GATCTCACAGTAGGTAGTGGTTACTGTGTGGATTACTCTAAAAACAGACGAAGTOGTGGAGCGATTACCA, 
2311  CIGGTTATCGGTTTACTAATTTTGAGCCATTTACCGTTAATTCAGTAAACGATAGTTTAGAACCTGTAGS 
2381 TGGTTTGTATGAAATTCAAATACCTTCAGAGTTTACTATAGGTAATATGGAGGAGTTTATTCARACAAGC 
2451  TCTCCTAAAGTTACTATIGATTGTGCTGCATTTGTCTGTGGTGATTATGCAGCATGTAAATCACAGTTGG 
2821  PIGGATATGGTAGTTTCTGTGATAACATTAATGCGATACTCACAGAAGTAAATGAACTACTTGACACTAC 
2591 ACAGTTGCAAGTAGCTAATAGTTTAATGAATGGTGTTACTCTTAGCACTAAGCTTARAGATGGCGTTAAT 
2661  PTCAATGTAGACGACATCAATTITTCCCCTGTATTAGGTTGTCTAGGCAGCGAATGTAGTAAAGCTTCCA 
2731 GTAGATCTGCTATAGAGGATTTACTTTTTGATAAAGTAAAGTTATCTGATGTCGGTTTTGTTGAGGCTTA. 
2801 _ TAATAATTGTACAGGAGGTGCCGAAATTAGGGACCTCATTTGTGTGCAAAGTTATARAGGCATCAAAGTS 
2871  TTGCCTCCACTGCTCTCAGAAAATCAGATCAGTGGATACACTTTGGCTGCCACCTCTGCTAGTCTATITC 
2941  CTCCTTGGACTGCAGCAGCAGGTGTACCATTTTATTTAAATGTTCAGTATCGCATTAATGGGCTTGSTCT 
3O11  CACCATGGATGTGCTAAGTCAAAATCAAAAGCTTATTGCTAATGCATTTAACAATGCCCTTTATGCTATT. 
3081  CAGGAAGGGTTCGATGCAACCAATTCIGCTTTAGTTAAAATTCAAGCTGTIGTTAATGCARATGCTGAAG 
2151 CTCTTAATAACTTATTGCAACAACTCTCTAATAGATITGGTGCTATARGTGCTICTTTACAAGAAATTCT 
3221 _ ATCTAGACTTGATGCTCTTGAAGCGGAAGCTCAGATAGATAGACTTATTAATGGTCGTCTTACCGCTCTT 
3291  AATGCTTATGTTTCTCAACAGCTTAGTGATTCTACACTGGTAAAATTTAGTGCAGCACAAGCTATGGAGA 
3361 AGGTTAATGAATGTGTCAAAAGCCAATCATCTAGGATAAATTTCTGTGGTAATGGTAATCATATTATATC 
3431 _ ATTAGTGCAGAATGCTCCATATGGTTTGTATTTTATCCACTTTAGTIATGTCCCTACTAAGTATGTCACA, 
3801 — GCGAAGGTTAGTCCCGETCTCTGCATTGCTGGTGATAGAGGTATAGCTCCTAAGAGTGGTTATTTTGTTA 
3571  ATGTAAATAATACTTGGATGTACACTGGTAGTGGTTACTACTACCTTGAACCTATAACTGAAAATAATGT 
3641 TGTTGTTATGAGTACCTGGGCTGTTAATTATACTAAAGCGCCGTATGTAATGCTGAACACTTCAATACCE 
2711 AACCTTCATGATTTTAAGGAAGAGTTGGATCAATGGTTTAAAAATCAAACATCAGTGGCACCAGATITGT 
3781 CACTTGATTATATAAATGTTACATTCTTGGACCTACAAGTTGAAATGAATAGCTIACAGGAGGCAATAAA 
3851  AGTCTTAAATCAGAGCTACATCAATCTCAAGGACATTGGTACATATGAATATTATGTAAAATGGCCTTGG 
3921 TATGTATGGCTE ITAA TCTACCTTIGCTGOTG TAGCTATGCTTGPTPTACTATTCTICATATGCTGTTGTA 
3991  CAGGATGTGGGACTAGTTGTTTTAAGAAATGTGGTGGTTGTIGNGATGATTATACTGGATACCAGGAGTT 


4061  AGTAATCAAAACTTCACATGACGACTAA, 


Fic. 2. Nucleotide sequence of the S gene from HCV-OC43. Sequences coding for the N-terminal signal peptide and the C-terminal membrane 
anchor, respectively, are underlined and printed in bold letters. The sequence coding for the potential cleavage site is bold-printed and 
underlined twice. The predicted polymorphic region in the S1 subunit of BCV (Yoo et a/., 1991) is indicated by a wavy line. 


in two steps into pBluescript and sequenced. The nu- 
cleotide sequence determined for the S gene of HCV- 
0C43 contains a single large open reading frame 
(ORF), which extends from the first ATG codon (nu- 
cleotides 5-7) to the TAA stop codon at nucleotides 
4086-4088 (Figs. 2 and 3). This ORF has a coding 
capacity of 1361 amino acids corresponding to a M, of 
150 kDa in the unprocessed form. There are 20 poten- 
tial sites for N-glycosylation in the sequence of HCV- 
0C43 in contrast to 19 in the S protein of BCV-Mebus 
{Abraham et a/., 1990). The predicted N-glycosylation 
site which is missing in BCV is located at amino acid 
position 502-504. Assuming a mean M, of 2100 Da 
per carbohydrate side chain (Hunter et a/., 1983), a M, 
of about 190 kDa is expected for the glycosylated pro- 
tein. Like the corresponding proteins of other corona- 
viruses, the S protein of HCV-OC43 shows the charac- 
teristics of a typical class | membrane protein. It has an 
N-terminal signal peptide with a hydrophobic core of 
13 amino acids and a helix-breaking glycine at position 
17 (Watson, 1984), suggesting that the signal pepti- 
dase cleaves between amino acids 17 and 18 (von 
Heijne, 1986). Close to the C-terminus, there is a 
stretch of hydrophobic amino acids (1322 to 1338) 
which is predicted to form an a-helix and to function as 
a membrane anchor (Chou-Fassman prediction; HU- 


SAR-Program, EMBL Heidelberg). By comparison with 
the S protein of related coronaviruses, the sequence 
772-Arg-Arg-Ser-Arg-Gly-778 is suggested to be the 
site for the proteolytic cleavage into the subunits $1 
and S2 (Fig. 3). This sequence is not identical to the 
proteolytic cleavage site determined for different 
strains of BCV and MHV, which contain either two 
pairs of arginines or a single arginine separated by two 
nonbasic amino acids from a pair of arginines (Schmidt 
et al, 1987; Luyties et a/., 1988; Boireau et a/., 1990: 
Parker et a/., 1990; Abraham et a/., 1990). 
HCV-0C43 has 120 amino acids, 92 in $1 and 28 in 
$2, which differ from the S protein of BCV-Mebus 
(Abraham et a/., 1990) including insertions and dele- 
tions. The resulting amino acid homology for S1 is 88% 
and for the S2 subunit 95.3%. The insertions found in 
HCV-OC43 are located within the first 256 amino 
acids, whereas deletions were found within amino 
acids 494-531. To see whether the cloned S gene is in 
a functional form, in vitro transcription/transiation ex- 
periments were performed. The capped mRNAs ob- 
tained with the transcription kit were incubated with 
white rabbit reticulocyte lysates in the presence or ab- 
sence of canine pancreatic microsomal membranes. 
[%§S]Methionine-labeled proteins were immunoprecipi- 
tated with a polyclonal antiserum directed against the 
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§ protein of BCV and analyzed on 8% SDS—polyacryl- 
amide gels (Fig. 4). Translation products were obtained 
with mRNAs derived from plasmids containing either 
the intact S gene or a fragment coding for S1. The 
faster migrating bands probably represent premature 
reaction products. The sizes of the largest proteins de- 
tected {indicated by arrows) are in good agreement 
with the molecular weights calculated for the glycosy- 
lated and unglycosylated form of the S protein (see 
above) and the S1 protein, respectively. Cleavage of S 
into the two subunits S1 and S2 was not observed in 
this system. These results indicate that the cloned S 
gene of HCV-O0C43 is in a functional form. 


DISCUSSION 


A high degree of homology was found between the S 
genes of HCV-OC43 and BCV. The overall relatedness 
between these two viruses had already been indicated 
by oligonucleotide fingerprinting which predicted a se- 
quence homology of greater than 96% ({Lapps and 
Brian, 1985). Compared to the N protein with 97.5% 
(Kamahora et a/., 1989) and the HE protein with 95% 
amino acid homology (Zhang et a/., 1992), the S protein 
of HCV-OC43 is less conserved (91.2%), especially the 
$1 subunit with a homology of 88%. The fact that the 
S2 portion of the molecule is more conserved than S1 
was also found in other coronaviruses, e.g., in IBV, 
MHV, and FIPV (for a review see Spaan et a/., 1988). 
Interestingly, HCV-OC43 shows insertions and dele- 
tions in the $1 part of the molecule, which, however, 
counterbalance each other and lead only to a loss of 
two amino acids compared to BCV. The deletions are 
all located in a part of S1 (amino acids 455-599), which 
in the case of BCV has been described as a polymor- 
phic region (Yoo et a/., 1991). It is the same region 
which in MHV-JHM is almost completely and in MHV- 
A659 partly missing (Schmidt et a/., 1987; Luytjes et al, 
1988). Taken all the sequence changes in the S protein 
of HCV-0C43 and BCV together, 25.5% of the amino 
acids are different within the polymorphic region com- 
pared to 12.2% in the rest of the $1 subunit and only 
7.7% in S2. 

The S protein of most strains of BCV and MHV is 
cleaved into the subunits S1 and S2 by a cellular pro- 
tease acting at the C-terminus of the sequence N-Arg- 
Arg-Xxx-Arg-Arg-C (Schmidt et a/., 1987; Luyties et a/., 
1988; Boireau et a/., 1990; Parker et a/., 1990; Abra- 
ham et a/., 1990; Zhang et a/., 1990). The substitution 
of Gly for Arg at position -1 of the cleavage site of HCV- 
OC43 may be responsible for the reduced cleavability 
reported for the spike protein of this virus (Hogue and 
Brian, 1986). We also observed that S was only cleaved 
to an extent of 10%, when OC43 was propagated in 
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HCV-0C43_ © 701/« AAFHANSBEP ALLSRNIKCN YVFNNGLTRQ LOPINYFDSY LGCAVNAYNS, 
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Fic. 3. The predicted amino acid sequence of the S protein from 
HCV-OC43 compared to strain Mebus of BCV (Abraham et a/., 
1990). For HCV-OC43, the complete sequence is presented: for 
BCV, only the amino acid differences are shown. The amino-terminal 
signal peptide and the carboxy-terminal membrane anchoring do- 
main are underlined and printed in bold letters. Potential sites for 
N-glycosyiation (Asn-Xxx-Ser/Thr) are indicated in the same way. 
Amino acids which are inserted or missing are marked by asterisks 
and the predicted polymorphic region in the S1-subunit of BCV (Yoo 
et ai., 1991} is marked by a wavy line. The prateolytic cleavage site is 
printed in bold letters and underlined twice. 


HRT-18 cells in contrast to BCV grown in this cell line, 
which showed nearly 80% cleavage of the S protein 
(data not shown). In the case of influenza viruses, the 
sequence Arg-Xxx-Lys/Arg-Arg has been determined 
as the consensus motif for the cleavage of the hemag- 
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Fic. 4. Analysis of the polypeptides obtained by én vitro transcrip- 
tion/translation of the S gene of HCV-OC43 and a fragment coding 
for the $1 subunit. The **-methionine-tabeled proteins were immuno- 
precipitated with a polyclonal antiserum directed against the S pro- 
tein of BCV and subjected to SDS-PAGE under denaturing condi- 
tions. The samples incubated in the presence of canine pancreatic 
microsomal membranes are indicated by (+). The size of molecular 
weight markers (kDa) is indicated on the left. 


glutinin by the cellular protease furin (Vey ef a/., 1992; 
Stienecke-Gréber et a/., 1992). This consensus se- 
quence can also account for the differential cleavage 
of the S protein of HCV-OC43, BCV, and MHV, respec- 
ively. The proteolytic cleavage of the influenza hemag- 
glutinin is an absolute requirement for the fusion activ- 
ity, and consequently for the infectivity, of the virus. As 
far as coronaviruses are concerned, proteases may 
have an enhancing effect (St. Cyr-Coats et a/., 1988). 
However, fusion activity and infectivity appear not to be 
rictly dependent on the proteolytic cleavage of S. The 
S protein of TGEV and related viruses completely lacks 
a motif for protealytic cleavage and is only found in the 
uncleaved form (Rasschaert and Laude, 1987). More- 
over, after site-directed mutagenesis of the cloned S 
gene, the surface protein of MHV was obtained in an 
uncleaved form, which nevertheless was fusion-active 
(Stauber et a/., 1993). 

We presented evidence that the S protein of HCV- 
0C43 is a hemagglutinin which interacts with 
Neu5,9Ac, present on the surface of erythrocytes. The 
HE protein was found to be unable to agglutinate 
chicken erythrocytes. Similar findings have been re- 
ported recently for BCV (Schultze et a/., 1991b), where 
the HE protein only agglutinates mouse or rat erythro- 
cytes which are very rich in surface-bound 9-O-acety- 
lated sialic acid. Thus, the S protein is the major sialic 
acid-binding protein for both HCV-OC43 and BCV. This 
may not be surprising because of the sequence similar- 
ity. There are, however, subtle differences in the affinity 
of both viruses for sialic acid. While BCV is more effi- 
cient in recognizing Neu5,9Ac, in an «2,3-linkage, 
HCV-0C43 is more effective toward an a2,6-linkage. In 


o. 
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this respect, HCV-OC43 was similar to influenza C 
virus. As both viruses infect humans, one might specu- 
late that the difference between these viruses and BCV 
in the preference for a certain linkage type is related to 
the host tropism. However, more strains have to be 
analyzed for conclusive evidence. Assuming that the 
binding site for 9-O-acetylated sialic acid is part of the 
$1 subunit, there are 92 amino acid changes which 
could be responsible for the difference between HCV- 
OC43 and BCV. This number is too high to altow a 
conclusion about the receptor-binding site. However, 
by extending this approach to more strains, it shouid 
be passible to get a clue about the location of the bind- 
ing site for 9-O-acetylated sialic acid. 
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